CDS

Accession Number TCMCG024C56441
gbkey CDS
Protein Id XP_022039481.1
Location complement(join(170988350..170988409,170988925..170989091,170989187..170989292,170989928..170989965,170990060..170990147,170990254..170990307,170991192..170991300,170991375..170991457,170992042..170992104,170992170..170992232,170992327..170992425,170992552..170992590,170992682..170992737,170993227..170993341,170993429..170993598,170993796..170993899,170994619..170994713,170995078..170995160,170995264..170995420))
Gene LOC110942082
GeneID 110942082
Organism Helianthus annuus

Protein

Length 582aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA396063
db_source XM_022183789.2
Definition imidazole glycerol phosphate synthase hisHF, chloroplastic [Helianthus annuus]

EGGNOG-MAPPER Annotation

COG_category E
Description Belongs to the HisA HisF family
KEGG_TC -
KEGG_Module M00026        [VIEW IN KEGG]
KEGG_Reaction R04558        [VIEW IN KEGG]
KEGG_rclass RC00010        [VIEW IN KEGG]
RC01190        [VIEW IN KEGG]
RC01943        [VIEW IN KEGG]
BRITE ko00000        [VIEW IN KEGG]
ko00001        [VIEW IN KEGG]
ko00002        [VIEW IN KEGG]
ko01000        [VIEW IN KEGG]
KEGG_ko ko:K01663        [VIEW IN KEGG]
EC -
KEGG_Pathway ko00340        [VIEW IN KEGG]
ko01100        [VIEW IN KEGG]
ko01110        [VIEW IN KEGG]
ko01230        [VIEW IN KEGG]
map00340        [VIEW IN KEGG]
map01100        [VIEW IN KEGG]
map01110        [VIEW IN KEGG]
map01230        [VIEW IN KEGG]
GOs -

Sequence

CDS:  
ATGGAGGCGGCGACGCTCAATTCACCGCAGTTTACCACCCTTCACGGTCCGCTACGTTCACGGAGGTCTCTACTGAAGTTACACAGCAATAGCATTAGCTTCAATTCTCCGGCAAAGTTCTCCGTTCGTGCCTCCGCCACTGGAGGCGGCGATTCAGCTGTAACTCTGCTTGATTATGGTGCTGGTAATGTTCAGAGTATTAGAAATGCAATTCGGTATCTAGGGCTCGATATCAAAGACGTTCAAACACCAGAGGACATTTTAAATGCTAAACGCCTTATATTTCCTGGGGTTGGAGCGTTTGCTGCAATGATGGATGTGCTAAACCGAAATGGGATGGCTGAAGCACTTCACACATACATAGAGAAAGATCGCCCATTTTTAGGCATTTGTCTTGGTCTGCAACTACTCTTTGAATCAAGTGAAGAAAACGGTCCTGTGAAAGGTCTTGGTGTGATTCCTGGGGTGGTTGGACGTTTCGATTCCTCTAACGGTTGCAGAGTGCCTCATATCGGTTGGAATTCATTGAAAGTTAATAAAGATTCAGTTATTTTAGATGATGTTGCAAATTCCCATGTATATTTTGTTCATTCTTACCGTGCTGTCCCGTCAGAAGAAAACGAAGAGTGGATTTCATCCACTTGCAATTACGGGATTGATTTCATATCATCTATTAGAAGGGGAAATGTACACGCGGTTCAGTTTCACCCGGAGAAAAGTGGAGATGTTGGTCTTTCAATACTGCGGAAGTTCTTGTTACCAAATTCTTCCATAACTAAGAAGCCGTTGGAAGGGAAGGCTACAAAGCTCGCAAAGAGGGTAATTGCTTGCCTTGATGTGAGAACGAATGATAACGGTGATCTTGTTGTTACTAAAGGAGACCAATATGACGTGAGAGAACAGACCAAAGATAATGAGGTGAGGAACCTGGGTAAACCAGTTGAACTTGCCGGACAGTATTACATAGATGGAGCTGATGAGGTTAGCTTTTTAAATATTACAGGGTTTCGTGATTTTCCCCTGGGTGATTTGCCAATGTTGCAGATCTTGAGGTACACGTCAGAGAATGTTTTTGTACCACTAACAGTTGGTGGTGGTATTCGAGATTTCACCGATGCAAATGGCAGATACTATTCTAGTTTGGAAGTAGCTTCAGAATATTTCAGATCTGGTGCAGATAAGATTTCTATTGGAAGTGATGCTGTTTATGCTGCCGAAGAATATCTAAAAACAGGGATAAAAACTGGAAAGAGCAGCTTAGAACAAATATCCAGAGTCTATGGGAATCAGGCAGTGGTAGTAAGCATTGACCCTCGTAGACAATATTTGACCAGTCCTTATGAGGTCGGATTCAAATCCGTTAAAGTAAGCAACTTAGGACCAAATGGTGAAGAGTATGCATGGTATCAGTGCACGGTGAATGGTGGACGAGAGGGTCGACCAATTGGTGCTTATGAGCTGGCAAAAGCTGTTGAAGAATTGGGAGCTGGGGAGATACTGCTGAACTGTATCGACTGTGATGGTCAAGGACAGGGATTTGATATCGATTTGATAAAGCTAATTTCTGATGCTGTGAGCATTCCTGTAATTGCGAGTAGTGGTGCTGGAAAAGCAGAACATTTTTCGGAAGTTTTTTCAGAAACAAATGCTTCTGCAGCTCTTGCTGCTGGCATTTTTCATAGGAAAGAGGTACCTATCCAGTCTGTAAAAGACCATCTATTAAAGAAAGGAATTGAAGTAAGGATATAG
Protein:  
MEAATLNSPQFTTLHGPLRSRRSLLKLHSNSISFNSPAKFSVRASATGGGDSAVTLLDYGAGNVQSIRNAIRYLGLDIKDVQTPEDILNAKRLIFPGVGAFAAMMDVLNRNGMAEALHTYIEKDRPFLGICLGLQLLFESSEENGPVKGLGVIPGVVGRFDSSNGCRVPHIGWNSLKVNKDSVILDDVANSHVYFVHSYRAVPSEENEEWISSTCNYGIDFISSIRRGNVHAVQFHPEKSGDVGLSILRKFLLPNSSITKKPLEGKATKLAKRVIACLDVRTNDNGDLVVTKGDQYDVREQTKDNEVRNLGKPVELAGQYYIDGADEVSFLNITGFRDFPLGDLPMLQILRYTSENVFVPLTVGGGIRDFTDANGRYYSSLEVASEYFRSGADKISIGSDAVYAAEEYLKTGIKTGKSSLEQISRVYGNQAVVVSIDPRRQYLTSPYEVGFKSVKVSNLGPNGEEYAWYQCTVNGGREGRPIGAYELAKAVEELGAGEILLNCIDCDGQGQGFDIDLIKLISDAVSIPVIASSGAGKAEHFSEVFSETNASAALAAGIFHRKEVPIQSVKDHLLKKGIEVRI